Model selection in compositional spaces
نویسنده
چکیده
We often build complex probabilistic models by composing simpler models—using one model to generate parameters or latent variables for another model. This allows us to express complex distributions over the observed data and to share statistical structure between different parts of a model. In this thesis, we present a space of matrix decomposition models defined by the composition of a small number of motifs of probabilistic modeling, including clustering, low rank factorizations, and binary latent factor models. This compositional structure can be represented by a contextfree grammar whose production rules correspond to these motifs. By exploiting the structure of this grammar, we can generically and efficiently infer latent components and estimate predictive likelihood for nearly 2500 model structures using a small toolbox of reusable algorithms. Using a greedy search over this grammar, we automatically choose the decomposition structure from raw data by evaluating only a small fraction of all models. The proposed method typically finds the correct structure for synthetic data and backs off gracefully to simpler models under heavy noise. It learns sensible structures for datasets as diverse as image patches, motion capture, 20 Questions, and U.S. Senate votes, all using exactly the same code. We then consider several improvements to compositional structure search. We present compositional importance sampling (CIS), a novel procedure for marginal likelihood estimation which requires only posterior inference and marginal likelihood estimation algorithms corresponding to the production rules of the grammar. We analyze the performance of CIS in the case of identifying additional structure within a low-rank decomposition. This analysis yields insights into how one should design a space of models to be recursively searchable. We next consider the problem of marginal likelihood estimation for the production rules. We present a novel method for obtaining ground truth marginal likelihood values on synthetic data, which enables the rigorous quantitative comparison of marginal likelihood estimators. Using
منابع مشابه
Developing a Compositional Reservoir Model for Investigating the Effect of Interfacial Tension on Oil Recovery
In this paper, a simplified formulation for compositional reservoir simulator is presented. These types of simulators are used when inter-phase mass transfer depends on phase composition as well as pressure. The procedure for solving compositional model equations is completely described. For equilibrium calculation, property estimation Peng Robinson equation of state is used. This equation ...
متن کاملEffect of CO2 Concentration in Injecting Gas on Minimum Miscibility Pressure: Compositional Model and Experimental Study
For technical and economic success of miscible gas injection projects, an accurate laboratory measurement of Minimum Miscibility Pressure (MMP) at reservoir conditions is essential. On the other hand, compositional reservoir simulator is a useful tool in gas injection studies and prediction of MMP. The main goal of this paper is to describe a procedure to generate a three phase sequential t...
متن کاملConceptual Spaces and Ontological Semantics. Compositionality in Nominal Phrases
Introduction This paper is concerned with reconciling the views of two prominent schools of meaning ascription: Logical compositional semantics in the Montague tradition, characterized by a reduction to set-theoretic constructs, and the cognitive semantics tradition as embodied in Gärdenfors’ conceptual spaces. This is achieved by adapting model-theoretic, Fregean compositional semantics to an ...
متن کاملNegative Selection Based Data Classification with Flexible Boundaries
One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...
متن کاملInteracting Conceptual Spaces I : Grammatical Composition of Concepts
The categorical compositional approach to meaning has been successfully applied in natural language processing, outperforming other models in mainstream empirical language processing tasks. We show how this approach can be generalized to conceptual space models of cognition. In order to do this, first we introduce the category of convex relations as a new setting for categorical compositional s...
متن کاملInteracting Conceptual Spaces
We propose applying the categorical compositional scheme of [6] to conceptual space models of cognition. In order to do this we introduce the category of convex relations as a new setting for categorical compositional semantics, emphasizing the convex structure important to conceptual space applications. We show how conceptual spaces for composite types such as adjectives and verbs can be const...
متن کامل